Picture for Feng Zhang

Feng Zhang

EvoCut: Multi-Layer Evolution-Aware Visual Token Compression for Efficient Large Vision-Language Models

Add code
Jun 01, 2026
Viaarxiv icon

MuChator: Enabling Active Music Discovery via Conversational Music LLMs in Douyin Music

Add code
May 26, 2026
Viaarxiv icon

DV-SFT: Direct Vision Supervision for Fine-Grained Visual Understanding

Add code
May 26, 2026
Viaarxiv icon

SEP-Attack: A Simple and Effective Paradigm for Transfer-Based Textual Adversarial Attack

Add code
May 24, 2026
Viaarxiv icon

RAVE: Re-Allocating Visual Attention in Large Multimodal Models

Add code
May 18, 2026
Viaarxiv icon

Revisiting Reinforcement Learning with Verifiable Rewards from a Contrastive Perspective

Add code
May 13, 2026
Viaarxiv icon

Evolving-RL: End-to-End Optimization of Experience-Driven Self-Evolving Capability within Agents

Add code
May 11, 2026
Viaarxiv icon

Bridging Passive and Active: Enhancing Conversation Starter Recommendation via Active Expression Modeling

Add code
May 07, 2026
Viaarxiv icon

Earth-o1: A Grid-free Observation-native Atmospheric World Model

Add code
May 07, 2026
Viaarxiv icon

Toward Scalable Terminal Task Synthesis via Skill Graphs

Add code
Apr 28, 2026
Viaarxiv icon